GDBR: An Optimal Relation Generalization Algorithm for Knowledge Discovery from Databases

نویسندگان

  • Colin L. Carter
  • Howard J. Hamilton
چکیده

We present GDBR, Generalize DataBase Relation, an optimal, on-line O(n) algorithm for database relation generalization using concept hierarchies. The algorithm is a variant of attribute-oriented induction which generalizes database relations in either O(n log n) or O(np) time, where n is the number of input tuples and p is the number of tuples in the output relation. We augment the concept hierarchy structures to provide inherent generalization, define an encounter order on input concepts, progressively generalize input concepts as the need arises, and structure the final generalized relation in such a way to allow tuples to be inserted in one step.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Evaluation of Attribute-Oriented Algorithms for Knowledge Discovery from Databases: Extended Report

Practical tools for knowledge discovery from databases must be efficient enough to handle large data sets found in commercial environments. Attribute-oriented induction has proved to be a useful method for knowledge discovery, but two algorithms which implement it, AOI and LCHR, have some limitations due to excessive memory usage. AOI and LCHR generalize database relations in O(np) and O(n log ...

متن کامل

Discovery of General Knowledge in Large Spatial Databases

Extraction of interesting and general knowledge from large spatial databases is an important task in the development of spatial dataand knowledge-base systems. In this paper, we investigate knowledge discovery in spatial databases and develop a generalization-based knowledge discovery mechanism which integrates attribute-oriented induction on nonspatial data and spatial merge and generalization...

متن کامل

Attribute-oriented Induction in Ob Ject-oriented Databases

Knowledge discovery in databases is the nontrivial extraction of implicit, previously unknown, and potentially useful information from data such that the extracted knowledge may facilitate deductive reasoning and query processing in database systems. This branch of study has been ranked among the most promising topics for database research for the 1990s. Due to the dominating influence of relat...

متن کامل

Knowledge Discovery in Fuzzy Databases Using Attribute-Oriented Induction

In this paper we analyze an attribute-oriented data induction technique for discovery of generalized knowledge from large data repositories. We employ a fuzzy relational database as the medium carrying the original information, where the lack of precise information about an entity can be reflected via multiple attribute values, and the classical equivalence relation is replaced with relation of...

متن کامل

Knowledge Discovery in Databases : A Rule - Based Attribute - Oriented ApproachDavid

An attribute-oriented induction has been developed in the previous study of knowledge discovery in databases. A concept tree as-cension technique is applied in concept generalization. In this paper, we extend the background knowledge representation from an unconditional non-rule-based concept hierarchy to a rule-based concept hierarchy , which enhances greatly its representation power. An eecie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994